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Summary. The price of electricity is far more volatile than that of other commodities normally 
noted for extreme volatility. The possibility of extreme price movements increases the risk of 
trading in electricity markets. However, underlying the process of price returns is a strong mean- 
reverting mechanism. We study this feature of electricity returns by means of Hurst R/S analysis, 
Detrended Fluctuation Analysis and periodogram regression. 

Key words. Long-range dependence. Electricity price. Hurst exponent. Mean-reversion 

1. Introduction 

There exists a strong evidence that price processes of financial assets should not be modeled by 
simple random walks (Bouchaud and Potters 1997, Weron and Weron 1998, Mantegna and Stanley 
1999). These processes seem to be persistent with memory lasting up to a few years (Peters 1994) 
and possess a non-trivial autocorrelation structure (Dacorogna et al. 1993, Guillaume et al. 1997). 
Recently it has been observed that, contrary to most financial assets, electricity price processes 
are mean-reverting or anti-persistent (Pilipovic 1998, Kaminski 1999, Weron 2000, Weron and 
Przybylowicz 2000). In this paper we investigate it more thoroughly by means of Hurst R/S 
analysis, Detrended Fluctuation Analysis and periodogram regression. 

2. Power markets 

The last decade has witnessed radical changes in the structure of electricity markets world-wide. 
Prior to the 1980s it was argued convincingly that the electricity industry was a natural monopoly 
and that strong vertical integration was an obvious and efficient model for the power sector. In 
the 1990s, technological advances suggested that it was possible to operate power generation and 
retail supply as competitive market segments (International Chamber of Commerce 1998, Masson 
1999). 

Changes came slowly at first, reflecting industry concern that competition and system security 
were mutually exclusive. Early experiments - in Scandinavia and in the UK - demonstrated clearly 
that the lights did not go out with the institution of competition. 

The deregulation process has recently intensified in Europe and North America, where market 
forces have pushed legislators to begin removing artificial barriers that shielded electric utilities 
from competition. Organizations which have been used to long-term fixed price contracts are 
now becoming increasingly exposed to price volatility and, of necessity, are seeking to hedge and 
speculatively trade to reduce their exposure to price risk. However, we have to bear in mind that 
electricity markets are not anywhere near as straightforward as financial or even other commodity 
markets. They have to deal with the added complexity of physical substance, which cannot simply 
be manufactured, transported and delivered, at the press of a button. 

Unlike other commodities, electricity cannot be stored efficiently. Therefore, a delicate balance 
must be maintained between generation and consumption - 24 hours a day, 7 days a week, 52 
weeks a year. Electric power may be generated from natural gas, coal, oil, nuclear fuel, falling 
water, geothermal steam, alternative resources such as cogeneration, and from renewable resources 
such as wind power, solar energy and biomass. Although the principles of generating electricity 



are simple, generating electricity for a country or a state the size of California, both in terms of 
geographic area and population, means a complex balancing process. Naturally, this has big impact 
on electricity prices and results in behavior not observed in the financial or even other commodity 
markets. It is thus extremely interesting to investigate the newly established power markets. 



3. Estimation of long-range dependence 

In economics and finance, long-range dependence has a long history (for a review see Baillie and 
King (1996) and Mandelbrot (1997)) and still is a hot topic of active research (Lux 1996, Lobato 
and Savin 1998, Willinger et al. 1999, Grau-Carles 2000). Historical records of economic and 
financial data typically exhibit nonperiodic cyclical patterns that are indicative of the presence of 
significant long memory. However, the statistical investigations that have been performed to test 
long-range dependence have often become a source of major controversies, especially in the case of 
stock returns. The reason for this are the implications that the presence of long memory has on 
many of the paradigms used in modern financial economics (Lo 1991). 

Various estimators of long-range dependence have been proposed. In this paper we apply 
rescaled range analysis, Detrended Fluctuation Analysis and periodogram regression to measure 
long memory in electricity prices. 



3.1 R/S analysis 

We begun our investigation with one of the oldest and best-known methods, the so-called rescaled 
range or R/S analysis. This method, proposed by Mandelbrot and Wallis (1969) and based on 
previous hydrological analysis of Hurst (1951), allows the calculation of the self-similarity parameter 
H, which measures the intensity of long-range dependence in a time scries. 

The analysis begins with dividing a time series (of returns) of length L into d subseries of length 
n. Next for each subseries m = 1, ...,d: 1° find the mean (Em) and standard deviation (5™); 2° 
normalize the data (Zi^ra) by subtracting the sample mean m — Zi.m — -^m for % — 1, 775 
3° create a cumulative time series y^.m = for i = l,...,n; 4° find the range Rm = 

maxjli.m, Yn^rn\ ~ niin{Yi.m, Yn^m}] and 5° rescale the range Rm/ Sm- Finally, calculate the 
mean value {R/S)n of the rescaled range for all subseries of length n. 

It can be shown that the R/S statistics asymptotically follows the relation {R/S)n ^ cn^ . Thus 
the value of H can be obtained by running a simple linear regression over a sample of increasing 
time horizons 

logiR/S)n^logc + Hlogn. (1) 

Equivalently, we can plot the {R/ S)n statistics against 71 on a double-logarithmic paper. If the 
returns process is white noise then the plot is roughly a straight line with slope 0.5. If the process 
is persistent then the slope is greater than 0.5; if it is anti-persistent then the slope is less than 0.5. 

However, it should be noted that for small n there is a significant deviation from the 0.5 
slope. For this reason the theoretical (i.e. for white noise) values of the R/S statistics are usually 
approximated by 

^^EV^ for «< 340, 

m/s)n^{ , l^r'r— (2) 

for n > 340, 



where F is the Euler gamma function. This formula is a slight modification of the formula given by 
Anis and Lloyd (1976); the (n— 5)/"- term was added by Peters (1994) to improve the performance 
for very small n. 

Formula (^) was used as a benchmark in all empirical studies in this paper, i.e. the Hurst 
exponent H was calculated as 0.5 plus the slope of {R/S)n — E{R/S)n- The resulting statistics 
was denoted by R/S-AL. 

A major drawback of the R/S analysis is the fact that no asymptotic distribution theory has 
been derived for the Hurst parameter H. The only known results are for the rescaled (but not 




by standard deviation) range Rm itself (Lo 1991). However, recently Weron (2001) has obtained 
empirical confidence intervals for the R/S statistics via a Monte Carlo study. We will use these 
values in the next Section. 



3.2 Detrended Fluctuation Analysis 

The second method we used to measure long-range dependence is the Detrended Fluctuation 
Analysis (DFA) proposed by Peng et al. (1994). The advantage of DFA over R/S analysis is that 
it avoids spurious detection of apparent long-range correlation that is an artifact of non-stationarity. 
The method can be summarized as follows. Divide a time series (of returns) of length L into d 
subseries of length n. Next for each subseries m = l,...,d: 1° create a cumulative time series 
^i,m = J2]=i ^j,m for i = 1, n; 2° fit a least squares line Yrn{x) = am.x + bm to Yn,m}; 
and 3° calculate the root mean square fluctuation (i.e. standard deviation) of the integrated and 
detrended time series 



F{m) 



^^(nm-«m^-M'- (3) 
i=l 

Finally, calculate the mean value of the root mean square fluctuation for all subseries of length n 

m=l 

Like in the case of R/S analysis, a linear relationship on a double-logarithmic paper of F{n) against 
the interval size n indicates the presence of a power-law scaling of the form cn^ (Peng ct al. 1994, 
Taqqu et al. 1995). If the returns process is white noise then the slope is roughly 0.5. If the 
process is persistent then the slope is greater than 0.5; if it is anti-persistent then the slope is less 
than 0.5. 

Unfortunately, no asymptotic distribution theory has been derived for the DFA statistics so 
far. However, like for the R/S analysis, Weron (2001) has obtained empirical confidence intervals 
for the DFA statistics via a Monte Carlo study. We will use these values in the next Section. 



3.3 Periodogram regression 

The third method is a semi-parametric procedure to obtain an estimate of the fractional differencing 
parameter d. This technique, proposed by Geweke and Porter-Hudak (1983) and denoted GPH 
in the text, is based on observations of the slope of the spectral density function of a fractionally 
integrated series around the angular frequency a; = 0. Since the spectral density function of a 
general fractionally integrated model (eg. FARIMA) with differencing parameter d is identical to 
that of a fractional Gaussian noise with Hurst exponent H = d + 0.5, the GPH method can be 
used to estimate H. 

The estimation procedure begins with calculating the periodogram, which is a sample analogue 
of the spectral density. For a vector of observations {xi, xl} the periodogram is defined as 

2 

(5) 

where iVk = k/L, k = 1, [L/2] and [x] denotes the largest integer less then or equal to x. The 
next and final step is to run a simple linear regression 

\og{lL{ujk)} = a - dlog{Asm'^{LUk/2)} + Ek, (6) 

at low Fourier frequencies LOk, k = 1,...,K < [L/2]. The least squares estimate of the slope yields 
the differencing parameter d through the relation d = d, hence H = d + 0.5. A major issue on 
the application of this method is the choice of K. Geweke and Porter-Hudak (1983), as well as a 
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Fig. 1. CalPX market daily average clearing prices since April 1st, 1998 until December 31st, 2000. Note 
the different scale in the bottom panel. 



number of other authors, recommend choosing K such that K — [L^'^], however, other values (eg. 
K = [L°-^% <K< [LO-5]) have also been suggested. 

Periodogram regression is the only of the presented methods, which has known asymptotic 
properties. Inference is based on the asymptotic distribution of the estimate 

d^N{d, ), (7) 

where Xt = log{4 sin^(ajfc/2)} is the regressor in eq. (^. 



4. Empirical analysis 

The first analyzed database was obtained from the University of California Energy Institute 
(UCEI). Among other data it contains market clearing prices from the California Power Exchange 
(CalPX) - a time series containing system prices of electricity for every hour since April 1st, 1998, 
0:00 until December 31st, 2000, 24:00. Because the series included a very strong daily cycle we 
created a 1006 days long sequence of average daily prices and plotted it in Fig. 1. The price 
trajectory suggests that the process does not exhibit a regular annual cycle. Indeed, since June 
2000, California's electricity market has produced extremely high prices and threats of supply 
shortages.^] We decided to treat this period as an anomaly and remove it from the data when 
measuring long-range dependence. Further analysis was conducted on a 731 days long sequence of 
average daily prices covering the period April 1st, 1998 - March 31st, 2000, i.e. two full years. 

All other analyzed time series were kindly provided by Bridge Information Systems. Most of the 
data sets included electricity prices since January 1st, 1998 until September 30th, 2000, however, 
for the analysis we selected data as indicated below: 

^Thc difficulties that have appeared are intrinsic to the design of the market, in which demand exhibits virtually 
no price responsiveness and supply faces strict production constraints. As yet there is no happy end to this story. 
On January 30th, 2001 the exchange suspended trading because it could not comply with FERC's (Federal Energy 
Regulatory Commission) directive not to allow bidding that is inconsistent with the mandated $150 breakpoint. 
Five weeks later, on March 9th, 2001, the California Power Exchange filed for Chapter 11 protection with the U.S. 
Bankruptcy Court. This was a serious blow to all protagonists of the power market liberalization. 
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Fig. 2. Nord Pool market daily average system prices since January 1st, 1998 until September 30th, 2000. 
The annual Scandinavian cycle (low prices in summer, high in winter) can be seen easily. 



• a 731 days long sequence of Nord Pool (Nordic Power Exchange) average daily system prices 
(electricity) for the period April 1st, 1998 - March 31st, 2000 (to be consistent with CalPX 
data); the full series is plotted in Fig. 2; 

• a 695 days long sequence of daily spot market closing prices (excluding weekends and holidays) 
for firm on-peak power in the Entergy region (Louisiana, Arkansas, Mississippi and East 
Texas) for the period January 2nd, 1998 - September 29th, 2000; 

• a 295 days long sequence of Telerate day-ahead U.K. electricity index (Monday through 
Friday only, including holidays) for the period September 1st, 1999 - September 29th, 2000. 

Before we present the results of the empirical analysis observe that R/S and DFA statistics 
require that length L of the data vector has as many divisors as possible. In three cases we had 
to reduce the original number of logarithmic returns in order to increase the number of divisors; 
for the CalPX and Nord Pool markets we selected the first 728 (out of 730) returns and estimated 
the Hurst exponent using subscrics of length n = 52,56,91,104,182 and 364; for the Entergy 
market we selected the first 690 (out of 694) returns and estimated H using subserics of length 
n = 69, 115, 138, 230 and 345. In the case of the U.K. market we only had 294 returns. Wc decided 
to use all subscrics of length n > 10, i.e. 14, 21, 42, 49, 98 and 147, and to calculate only the DFA 
statistics (since the rescaled range statistics yields large estimation errors for small n). To keep 
consistency, periodogram regression (GPH) estimates were obtained for the same data sets. 

Results of the long-range dependence analysis for returns of all four time series are reported in 
Table 1. Hurst exponent H estimates are given together with their significance at the (two-sided) 
90%, 95% or 99% level. Looking at the table we can classify the power markets into two categories: 
1° those where electricity price processes exhibit a strong mean-reverting mechanism and 2° those 
where electricity prices behave almost like Brownian motion. The California and Entergy markets 
fall into the first category, whereas the Scandinavian market behaves in a more random walk like 
fashion. Unfortunately, the short length of the fourth data set makes the results highly questionable 
and does not allow us to assign the U.K. day-ahead spot market to any category. 

To test if these results are an artifact of the seasonality in the electricity price process we 
applied a technique proposed in Weron et al. (2001) to remove the weekly and annual cycles in 
the two longest time series (CalPX and Nord Pool markets). The results, which are reported in 
Table 1, show that mean-reversion is not caused by seasonality. The estimated Hurst exponents for 
the California market are almost identical to the original ones. In the case of the Nord Pool data 



Table 1. Estimates of the Hurst exponent H for original and deseasonalized data 



Data 


R/S-AL 


Method 
DFA 


GPH 


Original data 








CalPX 


0.3473* 


0.2633*** 


0.0667*** 


Nord Pool 


0.4923 


0.4148 


0.1767** 


Entergy 


0.2995** 


0.3651** 


0.0218*** 


U.K. spot 




0.1330***''* 


0.1623* 


Deseasonalized data 

CalPX 

Nord Pool 


0.3259* 
0.5087 


0.2529*** 
0.4872 


0.1336** 
0.3619 



*, ** and *** denote significance at the (two-sided) 90%, 95% and 99% level, respectively. For the R/S-AL 
and DFA statistics inference is based on empirical Monte Carlo results of Weron (2001), whereas for the 

GPH statistics - on asymptotic distribution of the estimate of H . 

* Due to the small number of data points the DFA statistics for U.K. spot prices was calculated using 
subseries of length n > 10. 

the changes are also not substantial (except for the GPH estimate) and allow to reject long-range 
dependence. This random walk like behavior of prices is probably caused by the fact that the 
Scandinavian market is more stable than the U.S. or U.K. markets, with the majority of electricity 
being produced "on-demand" by hydro-storage power plants. 

5. Conclussions 

Our investigation of the power markets shows that there is strong evidence for mean-reversion in the 
returns series, which - what is important - is not an artifact of the seasonality in the electricity price 
process. This feature distinguishes electricity markets from the majority of financial or commodity 
markets, where there is no evidence for long-range dependence in the returns themselves. This 
situation calls for new models of electricity price dynamics. Simple continuous-time models were 
discussed in Weron et al. (2001), but surely more work has to be done in this interesting area. 
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